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IN THE CLAIMS 

Please amend the claims as follows: 

1 . (Currently Amended) In a computer system having a plurality of processors connected 
across a network to a shared memory, a method of decoupling a write address from its 
corresponding write data in a store to the shared memory, comprising: 

generating a write request address for a memory write, wherein the write request address 
points to a memory location in the shared memory; 

transferring a write request to the shared memory, wherein the write request includes the 
write request address; 

noting the write request address in the shared memory; 

enforcing memory ordering comparing, in the shared memory, addresses in subsequent 
load and store requests to the write request address until the write data associated with the write 
request is written into the shared memory ; 

when the corresponding write data becomes available, transferring the write data to the 
shared memory in instruction order across the network without the write request address; 

pairing, within the shared memory, the write request address with the separately 
transferred corresponding write data; and 

storing the write data into the shared memory as a function of the write request address. 

2. (Original) The method according to claim 1 , wherein the shared memory includes a store 
address buffer and wherein noting the write request address includes writing the address in the 
store address buffer. 

3 . (Currently Amended) The method according to claim 2, further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein comparing addresses in the subsequent load and store requests includes stalling 
the subsequent read load requests to the write request address until the write data is written into 
the shared memory if there is a match . 
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4. (Currently Amended) The method according to claim 1, further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein the shared memory includes a cache, wherein noting the write request address 
includes changing a state in a cache line associated with the write request address to 
"WaitForData", and wherein comparing addresses in the subsequent load and store requests to 
the write request address includes accessing the cache and stalling if a cache line hit returns a 
"WaitForData" state. 

5. (Currently Amended) The method according to claim 1 , further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein the shared memory includes a bit vector, wherein noting the write request 
address in the shared memory includes setting one or more bits in the bit vector corresponding to 
the write request address, and wherein comparing addresses in the subsequent load and store 
requests to the write request address includes comparing bits that would be set corresponding to 
the load and store request addresses with the bits set for the write request address and stalling 
servicing of the load and store requests if there is a match. 

6. (Currently Amended) The method according to claim 1 , further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein comparing addresses in the subsequent load and store requests includes stalling 
the subsequent load requests to the write request address until the write data is written into the 
shared memory if there is a match . 

7. (Currently Amended) The method according to claim 6, wherein comparing addresses in 
the subsequent load and store requests includes servicing the load and store requests to addresses 
other than the write request address without waiting for the write data to be written to the write 
request address if there is no match . 
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8. (Currently Amended) The method according to claim 1 , further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein comparing addresses in the subsequent load and store requests includes 
servicing the load and store requests to addresses other than the write request address without 
waiting for the write data to be written to the write request address if there is no match . 

9. (Canceled) 

1 0. (Previously Presented) The method according to claim 1 , wherein transferring a write 
request includes ensuring that all vector and scalar loads from shared memory for that processor 
have been sent to the shared memory prior to issuing the write request. 

1 1 . (Previously Presented) In a computer system having a plurality of processors connected 
to a shared memory, a method of decoupling a write address from its corresponding write data in 
a write to the shared memory, comprising: 

generating a write request address for a memory write, wherein the write request address 
points to a memory location in shared memory; 

issuing a first write request to the shared memory, wherein the first write request includes 
the write request address; 

noting the write request address in the shared memory; 

comparing, in the shared memory, addresses in subsequent read and write requests to the 
write request address; 

stalling the subsequent read requests to the write request address until the write data 
corresponding to the first write request is written into the shared memory; and 

if the address in a subsequent write request matches the write request address stored in 
the shared memory and there are no stalled read requests to the write request address, discarding 
the first write request. 
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12. (Original) The method according to claim 11, wherein the shared memory includes a 
store address buffer and wherein noting the write request address includes writing the address in 
the store address buffer. 

13. (Original) The method according to claim 12, wherein comparing addresses in 
subsequent read and write requests includes stalling subsequent read requests to the write request 
address until the write data is written into the shared memory. 

14. (Previously Presented) The method according to claim 1 1 , wherein the shared memory 
includes a cache, wherein noting the write request address includes changing a state in a cache 
line associated with the write request address to "WaitForData", and wherein comparing 
addresses in subsequent read and write requests to the write request address includes accessing 
the cache and stalling if a cache line hit returns a "WaitForData" state. 

1 5 . (Previously Presented) The method according to claim 1 1 , wherein the shared memory 
includes a bit vector, wherein noting the write request address in the shared memory includes 
setting one or more bits in the bit vector corresponding to the write request address, and wherein 
comparing addresses in subsequent read and write requests to the write request address includes 
comparing bits that would be set corresponding to the load and store request addresses the bits 
set for the write request address and stalling servicing of the load and store requests if there is a 
match. 

1 6. (Previously Presented) The method according to claim 1 1 , wherein comparing addresses 
in subsequent read and write requests includes stalling the subsequent read requests to the write 
request address until the write data is written into the shared memory. 

17. (Previously Presented) The method according to claim 16, wherein comparing addresses 
in subsequent read and write requests includes servicing the read and write requests to addresses 
other than the write request address without waiting for the write data to be written to the write 
request address. 



AMENDMENT AND RESPONSE UNDER 37 C.F.R. § 1.1 16 - EXPEDITED PROCEDURE Page 7 

Serial Number: 10/643,742 Dkt: 1376.697USI 

Filing Date: August 18, 2003 

Title: DECOUPLING OF WRITE ADDRESS FROM ITS ASSOCIATED WRITE DATA IN A STORE TO A SHARED MEMORY IN A 
MULTIPROCESSOR SYSTEM (As Amended) 



18. (Previously Presented) The method according to claim 11, wherein comparing addresses 
in subsequent read and write requests includes servicing the read and write requests to addresses 
other than the write request address without waiting for the write data to be written to the write 
request address. 

19. (Previously Presented) The method according to claim 11, wherein comparing addresses 
in subsequent read and write requests includes enforcing memory ordering in the subsequent 
read and write requests to the write request address until the write data associated with the first 
write request is written into the shared memory. 

20. (Previously Presented) The method according to claim 1 1 , wherein issuing a write 
request includes ensuring that all vector and scalar loads from shared memory for that processor 
have been sent to the shared memory prior to issuing the write request. 

21 . (Currently Amended) In a computer system having a plurality of processors connected 
across a network to a shared memory, a method of decoupling a write address from its 
corresponding write data in a store to the shared memory, comprising: 

generating a write request address for a vector store to memory, wherein the write request 
address points to a memory location in the shared memory; 

transferring a vector store request to the shared memory, wherein the write request 
includes the write request address; 

noting the write request address to the shared memory; 

enforcing memory ordering comparing, in tho shared memory, addresses in subsequent 
load and store requests to the write request address until the write data associated with the write 
request is written into the shared memory ; 

when the corresponding write data becomes available, transferring the write data from a 
vector register to the shared memory in instruction order across the network without the write 
request address; 
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pairing, within the shared memory, the write request address with the separately 
transferred corresponding write data; and 

storing the write data into the shared memory as a function of the write request address. 

22. (Original) The method according to claim 2 1 , wherein the shared memory includes a 
store address buffer and wherein noting the write request address includes writing the address in 
the store address buffer. 

23. (Currently Amended) The method according to claim 22, further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein comparing addresses in the subsequent load and store requests includes stalling 
the subsequent load requests to the write request address until the write data is written into the 
shared memory if there is a match . 

24. (Currently Amended) The method according to claim 2 1 , further comp rising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein the shared memory includes a cache, wherein noting the write request address 
includes changing a state in a cache line associated with the write request address to 
"WaitForData", and wherein comparing addresses in the subsequent load and store requests to 
the write request address includes accessing the cache and stalling if a cache line hit returns a 
"WaitForData" state. 

25 . (Currently Amended) The method according to claim 2 1 , further comprising comparing , 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein the shared memory includes a bit vector, wherein noting the write request 
address in the shared memory includes setting one or more bits in the bit vector corresponding to 
the write request address, and wherein comparing addresses in the subsequent load and store 
requests to the write request address includes comparing bits that would be set corresponding to 
the load and store request addresses with the bits set for the write request address and stalling 
servicing of the load and store requests if there is a match. 
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26. (Previously Presented) The method according to claim 2 1 , further comprising 
comparing, in the shared memory, addresses in the subsequent load and store requests to the 
write request address, wherein comparing addresses in the subsequent load and store requests 
includes stalling the subsequent load requests to the write request address until the write data is 
written into the shared memory if there is a match . 

27. (Currently Amended) The method according to claim 26, wherein comparing addresses 
in the subsequent load and store requests includes servicing the load and store requests to 
addresses other than the write request address without waiting for the write data to be written to 
the write request address if there is no match . 

28. (Currently Amended) The method according to claim 2 1 , further comprising comparing, 
in the shared memory, addresses in the subsequent load and store requests to the write request 
address, wherein comparing addresses in the subsequent load and store requests includes 
servicing the load and store requests to addresses other than the write request address without 
waiting for the write data to be written to the write request address if there is no match . 

29. (Canceled) 

30. (Previously Presented) The method according to claim 2 1 , wherein transferring a write 
request includes ensuring that all vector and scalar loads from shared memory for that processor 
have been sent to the shared memory prior to issuing the write request. 

3 1 . (Currently Amended) A method of decoupling vector data stores from vector instruction 
execution, comprising: 

executing a vector instruction on vector data stored in a vector register, wherein 
executing a vector instruction includes storing result vector data in the vector register; 
generating a vector write address for a vector store; 



AMENDMENT AND RESPONSE UNDER 37 C.F.R. § 1.1 16 - EXPEDITED PROCEDURE Page 1 0 

Serial Number: 10/643,742 Dkt: 1376.697US I 

Filing Date: August 1 8, 2003 

Title: DECOUPLING OF WRITE ADDRESS FROM ITS ASSOCIATED WRITE DATA IN A STORE TO A SHARED MEMORY IN A 
MULTIPROCESSOR SYSTEM (As Amended) 

transferring a vector store request across a network to memory, wherein the vector store 
request includes the vector write address; 

noting the vector write address in the memory; 

enforcing memory ordering comparing, in the memory, addresses in subsequent read and 
write requests to the vector write address until the result vector data associated with the vector 
store request is written into the memory ; 

when the corresponding write data becomes available, transferring the result vector data 
from the vector register to the memory in instruction order across the network without the vector 
write request address; 

pairing, within the memory, the vector write address with the separately transferred 
corresponding result vector data; and 

storing the result vector data into the memory as a function of the vector write address in 
the vector store request. 

32. (Currently Amended) The method according to claim 3 1 , further comprising comparing, 
in the memory, addresses in the subsequent read and write requests to the vector write request 
address, wherein comparing addresses in the subsequent read and write requests to the vector 
write address includes stalling the subsequent read requests to the vector write address until the 
result vector data is written into the memory. 

33. (Canceled) 

34. (Currently Amended) In a processor having a plurality of processing units connected 
across a network to a shared memory, a method of decoupling a write address from its 
corresponding write data in a write to the shared memory, comprising: 

generating a write request address for a memory write, wherein the write request address 
points to a memory location in the shared memory; 

transferring a write request to the shared memory, wherein the write request includes the 
write request address; 

storing the write request address in the shared memory; 
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comparing addresses in subsequent read and write requests to the write request address 
stored in the shared memory; 

enforcing memory ordering in the subsequent read and write requests to the write request 
address until the write data associated with the write request is written into the shared memory; 

when the corresponding write data becomes available, transferring the write data to the 
shared memory in instruction order across the network without the write request address; 

pairing, within the shared memory, the write request address with the separately 
transferred corresponding write data; and 

storing the corresponding write data into the shared memory as a function of the write 
request address. 

35. (Previously Presented) The method according to claim 34, wherein transferring a write 
request includes ensuring that all vector and scalar loads from shared memory for that processor 
have been sent to the shared memory prior to issuing the write request. 

36. (Currently Amended) The method according to claim 34. further comprisi ng comparing, 
in the shared memory, addresses in the subsequent read and write requests to the write request 
address, wherein comparing addresses in the subsequent read and write requests includes stalling 
the subsequent read requests to the write request address until the write data is written into the 
shared memory if there is a match . 

37. (Currently Amended) The method according to claim 34 36, wherein comparing 
addresses in the subsequent read and write requests includes enforcing memory ordering in the 
subsequent road and write requests to the write request address until the write data associated 
with the fir^t rp.qunr.t ir. writton into the r.harod memory servicing the read and write 
requests to addresses other than the write request address without waiting for the write data to be 
written to the write request address if there is no match . 



38. (Currently Amended) A computer system, comprising: 
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a plurality of processors, wherein the processors includes means for issuing a write 
address separate from data to be written to the write address; and 

a shared memory connected by a network to the plurality of processors, wherein the 
shared memory includes: 

means for receiving a first write request including a first write address; 
means for noting the first write request address to the shared memory; 
moans for comparing addresses in subsequent load and store requests to the first 
write address; 

means for stalling subsequent load and store requests to a memory location in the 
shared memory associated with the first write address until the data associated with the 
first write request is received and written by the shared memory; 

means for receiving write data in instruction order across the network without the 
write request address; and 

means for pairing the write request address with the separately transferred 
corresponding write data prior to storing the write data to the shared memory as a 
function of the write request address; 

wherein the processors enforce memory ordering in the subsequent load and store 
requests to the write request address until the write data associated with the first write request is 
written into the shared memory . 



